A 



6 

269.80 2 5 4 

229.80 6 5 1 

259.85 4 3 2 

179.05 4 2 0 



1 3 BTB_0304 9 

2 4 SPB_00387 
1 1 EN_00236 
0 2 NRB 00265 



B 



Fig. 1 




* 

Calculating 6-dimensional descriptor X t for 
each molecule M, 



Selecting arbitrarily 
400 molecules M, 
as initial population 

(subset Su) 



Genetic Algorithm 
(Fig.ZB) 
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f 


Outputting the m( 
ordered by the if 


)lecules M, of Su 
fitnesses f(M,) 



jr. 




Fig. 2A 
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Fig. 2A J 



Calculating fitness f(M ( ) for 
each molecule M, of the 
population Su (400 
molecules) 



Selecting from Su 
100 couples of parent 
molecules with a Roulette 
Wheel algorithm 



Producing 200 descendants MX, MY by crossover: 
Generating random number p; 0 < p £1 
descendant 1 (daughter): 
MX=(1-p) * (father genome X) + p * (mother genome Y) 

descendant 2 (son): 
MY=p * (father genome X) + (1-p) * (mother genome Y) 



Mutating the genomes of the 200 

descendants with a given 
probability by adding a random 
value r to each element x 



Searching the most similar molecule i 
to the descendants in the data base !> 



A. 



Calculating the fitness f (i.e., 
affinity) for each descend ant MX, 



Putting the descendants MX, MY 
into Su 



Removing the 200 individuals 
with the lowest fitness values 
from Su 
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descendant 1 (son) 
descendant 2 (daughter) 
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0,48| 
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0,42 
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0,26 
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Sum = 5,8 



0,821 



son = p x (father_jjenom)+(1-p) x (mother_genom) 
daughter = (1-p) x (fathfer_genom)+p x (mother_genom) 



p=0.2 
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Activity in evaluated subset 




Evaluated molecules per generation 




Fig. 7 



